Zadar County
PCARNN-DCBF: Minimal-Intervention Geofence Enforcement for Ground Vehicles
Yu, Yinan, Scheidegger, Samuel
Runtime geofencing for ground vehicles is rapidly emerging as a critical technology for enforcing Operational Design Domains (ODDs). However, existing solutions struggle to reconcile high-fidelity learning with the structural requirements of verifiable control. We address this by introducing PCARNN-DCBF, a novel pipeline integrating a Physics-encoded Control-Affine Residual Neural Network with a preview-based Discrete Control Barrier Function. Unlike generic learned models, PCARNN explicitly preserves the control-affine structure of vehicle dynamics, ensuring the linearity required for reliable optimization. This enables the DCBF to enforce polygonal keep-in constraints via a real-time Quadratic Program (QP) that handles high relative degree and mitigates actuator saturation. Experiments in CARLA across electric and combustion platforms demonstrate that this structure-preserving approach significantly outperforms analytical and unstructured neural baselines.
- North America > United States > California > San Francisco County > San Francisco (0.14)
- Asia > South Korea (0.14)
- Oceania > New Zealand > North Island > Auckland Region > Auckland (0.04)
- (14 more...)
- Information Technology (1.00)
- Automobiles & Trucks (1.00)
- Transportation > Ground > Road (0.94)
- (2 more...)
Charting the Future of Scholarly Knowledge with AI: A Community Perspective
Jiomekong, Azanzi, McGinty, Hande Küçük, Mills, Keith G., Oelen, Allard, Rajabi, Enayat, McElroy, Harry, Christou, Antrea, Saini, Anmol, Zebaze, Janice Anta, Kim, Hannah, Jacyszyn, Anna M., Auer, Sören
Scholarly work and communication encompass the entire system in which research and creative works are created, evaluated for quality, disseminated to the academic community and beyond, used, and preserved for future use. It includes formal publications, such as journal articles and books, as well as informal sharing through preprints, conference presentations, data sharing, and broader engagement with scholarly works and research outputs. Scholarly knowledge serves as the primary engine of progress, shaping our world and guiding our collective future. It forms the backbone of technological advancement, public health systems, and sustainable environmental practices. Obtained through rigorous methods of observation, experimentation, and validation, it is a reliable resource that helps societies solve complex problems and improve the quality of life by achieving sustainable development goals (SDGs) [6]. To be truly useful, scholarly knowledge must first be systematically extracted and organized. However, the scholarly community of today faces the problem of an overload of scientific papers in their respective domains. There is an increasing number of papers published every year (currently, 3 million), in addition to more than 200 million papers that have already been published . This gives rise to the research question: "How can we provide a reliable and living scholarly knowledge base that empowers researchers to query, synthesize, and analyze the vast body of scholarly knowledge?"
- North America > United States > Ohio > Montgomery County > Dayton (0.14)
- North America > United States > California > Los Angeles County > Los Angeles (0.14)
- Europe > Germany > Lower Saxony > Hanover (0.04)
- (14 more...)
Making Teams and Influencing Agents: Efficiently Coordinating Decision Trees for Interpretable Multi-Agent Reinforcement Learning
Chen, Rex, Milani, Stephanie, Zhang, Zhicheng, Sadeh, Norman, Fang, Fei
Poor interpretability hinders the practical applicability of multi-agent reinforcement learning (MARL) policies. Deploying interpretable surrogates of uninterpretable policies enhances the safety and verifiability of MARL for real-world applications. However, if these surrogates are to interact directly with the environment within human supervisory frameworks, they must be both performant and computationally efficient. Prior work on interpretable MARL has either sacrificed performance for computational efficiency or computational efficiency for performance. To address this issue, we propose HYDRA VIPER, a decision tree-based interpretable MARL algorithm. HYDRA VIPER coordinates training between agents based on expected team performance, and adaptively allocates budgets for environment interaction to improve computational efficiency. Experiments on standard benchmark environments for multi-agent coordination and traffic signal control show that HYDRA VIPER matches the performance of state-of-the-art methods using a fraction of the runtime, and that it maintains a Pareto frontier of performance for different interaction budgets.
- Europe > Germany > Bavaria > Upper Bavaria > Ingolstadt (0.06)
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
- North America > Canada (0.04)
- (15 more...)
Comparing Traditional and Reinforcement-Learning Methods for Energy Storage Control
Ginzburg, Elinor, Segev, Itay, Levron, Yoash, Keren, Sarah
We aim to better understand the tradeoffs between traditional and reinforcement learning (RL) approaches for energy storage management. More specifically, we wish to better understand the performance loss incurred when using a generative RL policy instead of using a traditional approach to find optimal control policies for specific instances. Our comparison is based on a simplified micro-grid model, that includes a load component, a photovoltaic source, and a storage device. Based on this model, we examine three use cases of increasing complexity: ideal storage with convex cost functions, lossy storage devices, and lossy storage devices with convex transmission losses. With the aim of promoting the principled use RL based methods in this challenging and important domain, we provide a detailed formulation of each use case and a detailed description of the optimization challenges. We then compare the performance of traditional and RL methods, discuss settings in which it is beneficial to use each method, and suggest avenues for future investigation.
- Energy > Power Industry (1.00)
- Energy > Energy Storage (1.00)
- Energy > Renewable > Solar (0.88)
Defect Detection in Photolithographic Patterns Using Deep Learning Models Trained on Synthetic Data
Shinde, Prashant P., Pai, Priyadarshini P., Adiga, Shashishekar P., Mayya, K. Subramanya, Seo, Yongbeom, Hwang, Myungsoo, Go, Heeyoung, Park, Changmin
In the photolithographic process vital to semiconductor manufacturing, various types of defects appear during EUV pattering. Due to ever-shrinking pattern size, these defects are extremely small and cause false or missed detection during inspection. Specifically, the lack of defect-annotated quality data with good representation of smaller defects has prohibited deployment of deep learning based defect detection models in fabrication lines. To resolve the problem of data unavailability, we artificially generate scanning electron microscopy (SEM) images of line patterns with known distribution of defects and autonomously annotate them. We then employ state-of-the-art object detection models to investigate defect detection performance as a function of defect size, much smaller than the pitch width. We find that the real-time object detector YOLOv8 has the best mean average precision of 96% as compared to EfficientNet, 83%, and SSD, 77%, with the ability to detect smaller defects. We report the smallest defect size that can be detected reliably. When tested on real SEM data, the YOLOv8 model correctly detected 84.6% of Bridge defects and 78.3% of Break defects across all relevant instances. These promising results suggest that synthetic data can be used as an alternative to real-world data in order to develop robust machine-learning models.
- North America > United States > Nevada > Clark County > Las Vegas (0.04)
- South America > Brazil > Rio de Janeiro > Niterói (0.04)
- North America > United States > California > Los Angeles County > Long Beach (0.04)
- (4 more...)
- Semiconductors & Electronics (1.00)
- Information Technology > Hardware (0.34)
Patent Figure Classification using Large Vision-language Models
Awale, Sushil, Müller-Budack, Eric, Ewerth, Ralph
Patent figure classification facilitates faceted search in patent retrieval systems, enabling efficient prior art search. Existing approaches have explored patent figure classification for only a single aspect and for aspects with a limited number of concepts. In recent years, large vision-language models (LVLMs) have shown tremendous performance across numerous computer vision downstream tasks, however, they remain unexplored for patent figure classification. Our work explores the efficacy of LVLMs in patent figure visual question answering (VQA) and classification, focusing on zero-shot and few-shot learning scenarios. For this purpose, we introduce new datasets, PatFigVQA and PatFigCLS, for fine-tuning and evaluation regarding multiple aspects of patent figures~(i.e., type, projection, patent class, and objects). For a computational-effective handling of a large number of classes using LVLM, we propose a novel tournament-style classification strategy that leverages a series of multiple-choice questions. Experimental results and comparisons of multiple classification approaches based on LVLMs and Convolutional Neural Networks (CNNs) in few-shot settings show the feasibility of the proposed approaches.
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.14)
- Europe > Austria > Vienna (0.14)
- (19 more...)
AdaVLN: Towards Visual Language Navigation in Continuous Indoor Environments with Moving Humans
Loh, Dillon, Bednarz, Tomasz, Xia, Xinxing, Guan, Frank
Visual Language Navigation is a task that challenges robots to navigate in realistic environments based on natural language instructions. While previous research has largely focused on static settings, real-world navigation must often contend with dynamic human obstacles. Hence, we propose an extension to the task, termed Adaptive Visual Language Navigation (AdaVLN), which seeks to narrow this gap. AdaVLN requires robots to navigate complex 3D indoor environments populated with dynamically moving human obstacles, adding a layer of complexity to navigation tasks that mimic the real-world. To support exploration of this task, we also present AdaVLN simulator and AdaR2R datasets. The AdaVLN simulator enables easy inclusion of fully animated human models directly into common datasets like Matterport3D. We also introduce a "freeze-time" mechanism for both the navigation task and simulator, which pauses world state updates during agent inference, enabling fair comparisons and experimental reproducibility across different hardware. We evaluate several baseline models on this task, analyze the unique challenges introduced by AdaVLN, and demonstrate its potential to bridge the sim-to-real gap in VLN research.
- Information Technology > Artificial Intelligence > Robots (1.00)
- Information Technology > Artificial Intelligence > Natural Language (1.00)
- Information Technology > Artificial Intelligence > Machine Learning (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.68)
Analysis of Visual Features for Continuous Lipreading in Spanish
Gimeno-Gómez, David, Martínez-Hinarejos, Carlos-D.
In our case, we employed a traditional approach to define the automatic system, in other words, a system based on Hidden During a conversation, our brain is responsible for combining Markov Models combined with Gaussian Mixture Models information obtained from multiple senses in order (GMM-HMM), an approach that has been widely used in to improve our ability to understand the message we are Acoustic Speech Recognition (ASR) [6]. Although this is not perceiving. Different studies have shown the importance of the state-of-the-art for speech-related signal recognition, it is presenting visual information in these situations. Nevertheless, an appropriate option for comparing the different possibilities lipreading is a complex task whose objective is to interpret for feature extraction. Unlike in ASR, when we deal with Visual speech when audio is not available. By dispensing with a sense Speech Recognition (VSR) our basic speech unit is not the as crucial as hearing, it will be necessary to be aware of the phoneme, but the one known as the viseme, which is associated challenge that this lack presents. In this paper, we propose an with the representation of the phoneme on the visual domain analysis of different speech visual features with the intention [7]. Unfortunately, there is not direct or one-to-one correspondence of identifying which of them is the best approach to capture between them, which causes visual ambiguities.
- Europe > United Kingdom > England (0.04)
- Europe > Spain > Valencian Community > Valencia Province > Valencia (0.04)
- Europe > Spain > Aragón > Zaragoza Province > Zaragoza (0.04)
- Europe > Croatia > Zadar County > Zadar (0.04)
- Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
- Information Technology > Artificial Intelligence > Vision > Image Understanding (0.72)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.50)
Review on Monitoring, Operation and Maintenance of Smart Offshore Wind Farms
Kou, Lei, Li, Yang, Zhang, Fangfang, Gong, Xiaodong, Hu, Yinghong, Yuan, Quande, Ke, Wende
In recent years, with the development of wind energy, the number and scale of wind farms have been developing rapidly. Since offshore wind farms have the advantages of stable wind speed, being clean renewable, non-polluting, and the non-occupation of cultivated land, they have gradually become a new trend in the wind power industry all over the world. The operation and maintenance of offshore wind powe has been developing in the direction of digitization and intelligence. It is of great significance to carry ou research on the monitoring, operation, and maintenance of offshore wind farms, which will be of benefit fo the reduction of the operation and maintenance costs, the improvement of the power generation efficiency improvement of the stability of offshore wind farm systems, and the building of smart offshore wind farms This paper will mainly summarize the monitoring, operation, and maintenance of offshore wind farms, with particular focus on the following points: monitoring of "offshore wind power engineering and biological and environment", the monitoring of power equipment, and the operation and maintenance of smart offshore wind farms. Finally, the future research challenges in relation to the monitoring, operation, and maintenance of smart offshore wind farms are proposed, and the future research directions in this field are explored especially in marine environment monitoring, weather and climate prediction, intelligent monitoring of powe equipment, and digital platforms.
- Europe > North Sea (0.14)
- Pacific Ocean > North Pacific Ocean > South China Sea (0.04)
- Asia > China > Shanghai > Shanghai (0.04)
- (35 more...)
Real-time Interface Control with Motion Gesture Recognition based on Non-contact Capacitive Sensing
Lee, Hunmin, Mandivarapu, Jaya Krishna, Ogbazghi, Nahom, Li, Yingshu
Capacitive sensing is a prominent technology that is cost-effective and low power consuming with fast recognition speed compared to existing sensing systems. On account of these advantages, Capacitive sensing has been widely studied and commercialized in the domains of touch sensing, localization, existence detection, and contact sensing interface application such as human-computer interaction. However, as a non-contact proximity sensing scheme is easily affected by the disturbance of peripheral objects or surroundings, it requires considerable sensitive data processing than contact sensing, limiting the use of its further utilization. In this paper, we propose a real-time interface control framework based on non-contact hand motion gesture recognition through processing the raw signals, detecting the electric field disturbance triggered by the hand gesture movements near the capacitive sensor using adaptive threshold, and extracting the significant signal frame, covering the authentic signal intervals with 98.8% detection rate and 98.4% frame correction rate. Through the GRU model trained with the extracted signal frame, we classify the 10 hand motion gesture types with 98.79% accuracy. The framework transmits the classification result and maneuvers the interface of the foreground process depending on the input. This study suggests the feasibility of intuitive interface technology, which accommodates the flexible interaction between human to machine similar to Natural User Interface, and uplifts the possibility of commercialization based on measuring the electric field disturbance through non-contact proximity sensing which is state-of-the-art sensing technology.
- North America > United States > Georgia > Fulton County > Atlanta (0.05)
- Asia > Taiwan > Taiwan Province > Taipei (0.04)
- Asia > South Korea > Busan > Busan (0.04)
- (11 more...)